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Abstract 

Leptospirosis, an emerging zoonotic disease with worldwide distribution, is caused by spiroclietes belonging to tlie genus 
Leptospira. More than 500,000 cases of severe leptospirosis are reported annually, with >10% of these being fatal. 
Leptospires can survive for weeks in suitably moist conditions before encountering a new host. Reservoir hosts, typically 
rodents, exhibit little to no signs of disease but shed large numbers of organisms in their urine. Transmission occurs when 
mucosal surfaces or abraded skin come into contact with infected urine or urine-contaminated water or soil. In humans, 
leptospires can cause a variety of clinical manifestations, ranging from asymptomatic or mild fever to severe icteric (Weil's) 
disease and pulmonary haemorrhage. Currently, little is known about how Leptospira persist within a reservoir host. Prior in 
vitro studies have suggested that leptospires alter their transcriptomic and proteomic profiles in response to environmental 
signals encountered during mammalian infection. However, no study has examined gene expression by leptospires within a 
mammalian host-adapted state. To obtain a more faithful representation of how leptospires respond to host-derived 
signals, we used RNA-Seq to compare the transcriptome of L. interrogans cultivated within dialysis membrane chambers 
(DMCs) implanted into the peritoneal cavities of rats with that of organisms grown in vitro. In addition to determining the 
relative expression levels of "core" housekeeping genes under both growth conditions, we identified 166 genes that are 
differentially-expressed by L interrogans in vivo. Our analyses highlight physiological aspects of host adaptation by 
leptospires relating to heme uptake and utilization. We also identified 1 1 novel non-coding transcripts that are candidate 
small regulatory RNAs. The DMC model provides a facile system for studying the transcriptional and antigenic changes 
associated with mammalian host-adaption, selection of targets for mutagenesis, and the identification of previously 
unrecognized virulence determinants. 
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Introduction 

Leptospirosis is a neglected disease of global significance [1,2]. 
Pathogenic leptospires, shed in animal urine or free-living within 
contaminated water, enter the host through small abrasions in the 
skin or contact with mucous membranes of the eyes, nose or 
throat. Organisms disseminate almost immediately following 
acquisition, travelling via the bloodstream to multiple tissues [3]. 



L. interrogans, an extracellular pathogen, is thought to penetrate 
host tissues by intercellular migration [4]. In immunocompetent 
hosts, the majority of leptospires are thought to be cleared by 
opsonophagocytosis following the appearance of specific antibod- 
ies [5]. However, organisms that reach the kidneys, an 
immunoprivUeged site [1], adhere to and colonize the proximal 
convoluted renal tubules, where they replicate exponentially. The 
majority of human disease is caused by Leptospira interrogans serovar 
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Author Summary 

Leptospirosis, a global disease caused by the unusual 
bacteriunn Leptospira, is transmitted fronn animals to 
humans. Pathogenic species of Leptospira are excreted in 
urine from infected animals and can continue to survive in 
suitable environments before coming into contact with a 
new reservoir or accidental host. Leptospires have an 
inherent ability to survive a wide range of conditions 
encountered in nature during transmission and within 
mammals. However, we know very little about the 
regulatory pathways and gene products that promote 
mammalian host adaptation and enable leptospires to 
establish infection. In this study, we used a novel system 
whereby leptospires are cultivated in dialysis membrane 
chambers implanted into the peritoneal cavities of rats to 
compare the gene expression profiles of mammalian host- 
adapted and in wfro-cultivated organisms. In addition to 
providing a facile system for studying the transcriptional 
and physiologic changes leptospires undergo during 
mammalian infection, our data provide a rational basis 
for selecting new targets for mutagenesis. 

(sv.) Copenhageni for which Rattus nowegicus serves as a reservoir 
host [3,6,7]. Experimentally-infected rats can excrete up to 10 
leptospires/ml of urine for months without clinical signs of 
infection, thus exemplifying the unique biological equilibrium that 
can exist between pathogen and reservoir host [8,9, 1 0] . 

The genome sequences of several pathogenic and saprophytic 
Leptospira spp., including L. interrogans sv. Copenhageni, are now 
complete [6,11,12,13,14,15,16]. L. interrogans sv. Copenhageni 
Fiocruz Ll-130 harbors 3728 protein-encoding genes [11,12]. By 
comparative genomics, Picardeau et at. [14] identified 1431 
"pathogen-specific" genes that are present within either or both 
of the pathogenic species, L. interrogans and L. borgpetersenii, but are 
absent from the free-living saprophyte L. biflexa. Although the 
majority (62%) of these pathogen-specific genes encode proteins of 
unknown function, it is possible that some are required by 
Leptospira to respond to unique environmental cues encountered 
within the mammalian host. Along these lines, the genome of L. 
interrogans contains >200 protein-coding sequences potentially 
involved in gene regulation, including gene products associated 
with two component signal transduction systems, alternate sigma 
factors, anti-sigma factors, and anti-sigma factor antagonists 
[11,12]. Not surprisingly, the pathogen-specific group also includes 
numerous gene products whose annotated functions or cellular 
location suggest a potential role in virulence-related processes such 
as adherence, digestion of host tissues and extracellular matrix, 
and evading the host's innate and adaptive immune responses 
[14,17]. 

To identify novel leptospiral virulence determinants, investiga- 
tors have manipulated in vitro growth conditions to simulate those 
encountered within the mammalian host, including increased 
temperature and/ or osmolarity, iron starvation, and the presence 
of serum [18,19,20,21,22]. However, the extent to which these in 
vitro conditions faithfully reproduce those encountered by Leptospira 
in vivo is unclear. In an effort to characterize leptospires in a truly 
mammalian host-adapted state, we cultivated virulent low-passage 
L. interrogans sv. Copenhageni within the peritoneal cavities of rats 
using a modification of our dialysis membrane chamber (DMC) 
model [23,24]. Given that rats are a natural reservoir host for this 
species ot Leptospira [2,25,26], we reasoned that this model would 
be ideal for this purpose. Originally developed to study host 
adaption by Lyme disease spirochetes (Borrelia burgdoiferi) [23,24], 



this technique, which uses dialysis membrane tubing with an 
8000 Da molecular weight cut-off, provides bacteria with access to 
host nutrients while protecting them from the host's cellular 
immune response. The DMC model has been instrumental in 
studying the contribution of mammalian host-specific signals to 
differential gene expression in B. burgdorfen on a genome-wide scale 
as well as enabling us to characterize the transcriptional and 
physiological changes integral to the mammalian host-adaptation 
process [23,24,27,28,29]. 

In recent years, high-throughput RNA sequencing (RNA-Secj) 
has replaced microarrays as the method of choice for genome- wide 
transcriptional profiling in bacteria [30,31]. Unlike microarrays, 
RNA-Seq allows transcription to be understood at the single- 
nucleotide level. Here, we used an RNA-Seq approach to compare 
the transcriptome of virulent low passage Leptospira interrogans sv. 
Copenhageni cultivated within DMCs with that of leptospires 
grown under standard in vitro conditions (30°C in EMJH). Using 
this approach, we determined the relative expression levels of 
"core" housekeeping genes under both growth conditions, and, 
more importandy, we identified 166 genes that are differentially- 
expressed by leptospires within the mammalian host, the majority 
of which are pathogen-specific [i.e., not present within saprophytic 
Leptospira). Most notably, our analyses highlight novel physiological 
aspects of mammalian-host adaptation by leptospires with respect 
to heme uptake and utilization. Moreover, we identified 1 1 novel 
non-coding (ncRNAs) transcripts which represent candidate small 
regulatory RNAs. In addition to providing a facile system for 
studying the transcriptional and physiologic changes leptospires 
undergo during mammalian infection, our data provide a rational 
basis for selecting new targets for mutagenesis. 

Results 

Virulent leptospires become mammalian host-adapted 
during growth within dialysis membrane chambers 

Our extensive experience with cultivation of Lyme disease 
spirochetes in DMCs implanted into rats [23,24,27,32], a natural 
reservoir for L. interrogans, led us to ask whether the DMC model 
could be used to generate mammalian host-adapted Leptospira. In 
preliminary experiments, we determined that virulent low-passage 
L. interrogans sv. Copenhageni strain Fiocruz Ll-130, diluted to low 
density (1x10* leptospires/ml) in EMJH medium, undergoes 
exponential replication within DMCs, reaching a maximal density 
of ~7xlO' leptospires/ml within 8 days post-implantation (data 
not shown). Importandy, leptospires recovered from DMCs 
explanted daily between 8 and 12 days post-implantation were 
vigorously motile by dark-field microscopy. The polypeptide 
profiles of leptospires in DMCs explanted between 9 and 12 days 
were highly similar (data not shown). On the basis of these studies, 
we chose 10 days as our standard period for intraperitoneal 
implantation. As shown in Figure lA, under these conditions, we 
noted numerous polypeptides whose expression was either 
increased or decreased in response to mammalian host-derived 
signals compared to in vitro-grown bacteria. The polypeptide 
differences between in vitro- and DMC-cultivated organisms were 
even more apparent by two-dimensional SDS-PAGE (Figure SI). 
While a comprehensive quantitative analysis of these differentially- 
expressed polypeptides is necessary to identify the corresponding 
leptospiral proteins, these data support our contention that 
virulent leptospires substantially alter their proteome in response 
to mammalian host-specific signals. 

With B. burgdorferi, successful mammalian host-adaptation within 
DMCs is determined by the reciprocal expression of the outer 
surface lipoprotein (Osp) A and OspC lipoproteins, which are 
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Figure 1. Virulent ieptospires become mammalian host-adapted during growth within dialysis membrane chambers. Representative 
whole cell lysates of Ieptospires cultivated to late-logarithmic phase in EMJH medium at 30°C in vitro (IV) and within dialysis membrane chambers 
(DMC) Implanted into the peritoneal cavities of female Sprague-Dawley rats. (A) Lysates were loaded according to the numbers of Ieptospires (5x10^ 
per lane) or total protein (5 ]ig per lane) and stained with SYPRO Ruby gel stain. Arrows and asterisks are used to highlight examples of polypeptides 
whose expression appears to be increased or decreased, respectively, within DMCs compared to in vitro. Molecular mass markers are indicated on the 
left. (B) Immunoblot analyses using rabbit polyclonal antisera directed against Sph2 [34], LipL32 [38] and LipL41 [39]. An arrow is used to indicate a 
band of the predicted molecular mass for SphH, a second, closely-related sphingomyelinase in L. interrogans recognized by antiserum directed 
against Sph2 [34,37]. 
doi:l 0.1 371/journal.ppat.1 004004.g001 



OFF and ON, respectively, within the mammal [23]. However, no 
expression profile associated with host-adapted L. interrogans has 
been reported and only a handful of leptospiral genes/proteins 
have been shown to be reproducibly upregulated during mam- 
malian infection. Among these is Sph2, one of four sphingomy- 
elinase-like proteins encoded by Z. interrogans sv. Copenhageni [12]. 
Although most strains of L. interrogans encodes at least 3 distinct 
sphingomyelinase-like proteins (Sphl, Sph2 and Sph3), only Sph2 
is thought to be a "true" (i.e., enzymatically active) sphingomy- 
elinase [33]. Expression of Sph2 is upregulated in vitro in response 
to serum [2 1] and/ or increased osmolarity [34] and during 
mammalian infection [35]. On the other hand, SphH, a closely- 
related pore-forming protein without sphingomyelinase activity 
[33,3(i], is expressed constitutively in vitro [34,37] and by 
Ieptospires colonizing the renal tubules of infected hamsters [37]. 
Consistent with these previous studies, the level of Sph2 was 
substantially higher in DMC-cultivated Ieptospires compared to in 
vitro-grown organisms, whereas SphH was expressed at similar 
levels under both conditions (Figure IB). Immunoblots using 
antisera against LipL32 and LipL41, two leptospiral lipoproteins 
expressed constitutively m vitro and during mammalian infection 
[38,39,40], were performed as loading controls (Figure IB). We 
considered these data as strong indication that DMC-cultivated 
Ieptospires are in a mammalian host-adapted state. 

RNA-Seq analysis of Leptospira cultivated in vitro and 
within PMCs 

Having established the feasibility of using DMCs to generate 
mammalian host-adapted L. interrogans, we compared the tran- 
scriptional profiles of DMC- and in I'ffro-cultivated Ieptospires by 
RNA-Seq. To ensure that our data would be robust and 
reproducible, we generated lUumina TruSeq libraries from three 
biologically-independent samples for each growth condition. The 
sequence statistics and numbers of mapped reads for each 
biological replicate are summarized in Table 1 and displayed 
graphically in Figure 2. The total number of reads ranged from 
~8-14 million per library, of which 79-94% of reads mapped to 



the L. interrogans sv. Copenhageni Fiocruz LI -130 reference 
genome [11,12]; only those reads that mapped to a single location 
on either Chromosome 1 or 2 were used to assess gene expression. 
The majority (43-55 "/o) of unique sequence reads mapped to 
protein-coding mRNAs annotated on Chromosome 1, while ~3— 
5% mapped to predicted ORFs on Chromosome 2; this 12:1 ratio 
is consistent with the relative coding capacities of the two 
chromosomes [11,12]. As discussed below, a considerable number 
of reads (13-20%) in both chromosomes mapped to non-coding 
regions that represent candidate small regulatory RNAs (sRNAs) 
(Table 1). 

RNA-Seq provides comprehensive coverage of the 
leptospiral transcriptome under in vivo and in vitro 
growth conditions 

The genome of Z. interrogans sv. Copenhageni Fiocruz LI -130 
harbors 3728 protein-encoding genes [11,12]. The vast majority 
(-94%) of these (3489 and 3499 in DMC- and in ariro-cultivated 
Ieptospires, respectively), were represented in our RNA-Seq data 
by a mean expression value of a I (Table S2). We observed 
average mean expression values of 67.2 and 60.5 per gene in 
DMC- and in ra'fro-cultivated organisms, respectively (data not 
shown). 

By comparative genomics, Picardeau et at. [14] identified 2052 
"core" protein-coding genes that are shared between pathogenic 
{L. interrogans and L. bor^etersenii) and saprophytic {L. biflexa) 
Leptospira species. Not surprisingly, many of these core gene 
products are associated with housekeeping functions, such as 
motility, energetics and intermediary metabolism, DNA and RNA 
metabolism, and cell division [14]. Analysis of the protein-coding 
sequences for the 100 most highly-expressed genes {i.e.. Top 100) 
in DMC-cultivated Ieptospires revealed that 66 are conserved {i.e., 
S40% amino acid identity over S80% of the coding region) 
between pathogenic and saprophytic Leptospira spp. and, therefore, 
part of the core group (Table S3); of note, the percentage (66%) of 
core genes within our Top 100 is similar to the overall percentage 
(55%) of core genes within the entire L. interrogans sv. Copenhageni 
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Figure 2. Mapping of RNA-Seq reads. Percentage of uniquely 
mapping reads from each biological replicate of leptospires cultivated 
in DMCs or under standard in vitro growth conditions (30'C in EMJH). 
doi:10.1371/journal.ppat.1004004.g002 
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genome [14]. Consistent with their proposed housekeeping 
functions, 62 (94%) of the 66 core genes within the Top 100 
were expressed at similar levels in vitro and within DMCs (Table 
S3). Thirty-four of the Top 100 genes are pathogen-specific {i.e., 
no orthologous gene identified in L. bijlexd), two of which 
{Lie 1 0465 /ligA and LIC12653) are found only in L. interrogans 
{i.e., absent in L. horgpetersensii, L. licerasiae and L. santarosai). Eight of 
the 34 pathogen-specific genes within the Top 100 were 
upregulated by L. interrogans sv. Copenhageni within DMCs (see 
Table S3 and below). 

We also surveyed both DMC- and in oifro-derived datasets for 
genes associated with key metabolic pathways. One unusual 
metabolic feature of pathogenic leptospires, compared to other 
spirochetes, is that they are unable to utilize glucose despite 
encoding a seemingly complete glycolytic pathway, relying instead 
on P-oxidation of long-chain fatty acids as sources of both carbon 
and energy [11,41]. By RNA-Seq, we detected uniquely mapped 
reads for all of the genes thought to be involved in glucose uptake 
and utilization (KEGG pathway licOOOlO), each of which was 
expressed at similar levels in DMCs and in vitro (Table S4). 
However, two genes, LIC13358 and LIC20119, both encoding 
putative phosphoglucomutases, and LIC12908, encoding the only 
glucose transporter identified in L. interrogans [11,12,42], were 
expressed at extremely low levels, both in DMCs and in vitro (Table 
S2). These data support the findings of Zhang et al. [42], who 
proposed that the inability of pathogenic leptospires to utilize 
glucose stems from insufficient glucose uptake and/or catalysis 
rather than an incomplete glycolytic pathway. As one might 
predict, we detected significant numbers of sequence reads for 
genes involved in the uptake and P-oxidation of medium and long- 
chain fatty acids (KEGG pathway lic00071), the citric acid cycle 
(KEGG hc00020), generation of NAD/NADP (KEGG hc00760), 
and oxidative phosphorylation (KEGG lic00190). AH of the 
individual genes involved in these energetic pathways were 
expressed at similar levels under both growth conditions (Table S4). 

Genes whose expression was significantly upregulated 
by leptospires in DMCs compared to in wfro-grown 
bacteria 

Using DESeq [43], we identified 166 genes whose expression 
was either positively- or negatively-regxilated by & 2-fold (adjusted 
p-value£0.05) within the mammal (Tables 2 and 3). Although 
some variance was observed between biological replicates (Table 
S2), a heat map representing the expression data for all 166 
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Table 2. L. interrogans sv. Copenhageni genes upregulated In DMCs compared to in vitro. 



Gene ID^ 




Product^ 


Fold (DMC vs IV) 


P value (adjusted) 


Hypothetical Proteins 


Ua0822 




Hypothetical protein 


ND @ 30°C 


4.00E-02 


LIC12077 




Hypothetical protein 


22.78 


1.19E-18 


UC13005 




Hypothetical protein 


21.12 


2.75E-04 


Lia3445 




Hypothetical protein 


17.46 


6.13E-06 


LIC11059 




Hypothetical protein 


16.76 


2.65E-13 


Ua0456 




Hypothetical protein 


10.59 


4.12E-02 


UCW965 




Hypothetical protein 


9.73 


8.94E-08 


LIC12340 




Hypothetical protein 


7.14 


3.40E-10 


LIC13390 




Hypothetical protein 


6.60 


2.74E-04 


LIC10455 




Hypothetical protein 


6.54 


1 .04E-02 


UCUUO 




Hypothetical protein 


6.20 


1 .93E-02 


LIC10376 




Hypothetical protein 


6.14 


2.07E-02 


LICl 1888 




Hypothetical protein 


5.71 


1 .33E-08 


LIC10535 




Hypothetical protein 


5.61 


2.65 E-02 


LICl 2555 




Hypothetical protein 


4.76 


5.29E-04 


LICl 0971 




Hypothetical protein 


4.35 


8.70E-03 


LICl 0775 




Hypothetical protein 


4.27 


8.69E-07 


LIC10790 




Hypothetical protein 


4.15 


8.98E-03 


LICl 1695 




Hypothetical protein 


4.06 


9.65E-03 


LICl 1492 




Hypothetical protein 


4.03 


4.56E-02 


LIC12986 




Hypothetical protein 


4.03 


2.53E-02 


UC10415 




Hypothetical protein 


3.93 


5.73E-06 


LICl 3354 




Hypothetical protein 


3.77 


4.53E-03 


LIC12653 




Hypothetical protein 


3.63 


5.83E-03 


LICl 2993 




Hypothetical protein 


3.53 


1.19E-03 


LICl 0593 




Hypothetical protein 


3.32 


3.07E-03 


LICl 1705 




Hypothetical protein 


3.14 


3.55E-02 


LICl 0080 




Hypothetical protein 


3.14 


5.99E-03 


LICl 0374 




Hypothetical protein 


3.09 


2.12E-02 


UC10729 




Hypothetical protein 


2.97 


2.12E-02 


LICl 0450 




Hypothetical protein 


2.94 


1.21 E-02 


LIC10454 




Hypothetical protein 


2.86 


4.53E-02 


LICl 1783 




Hypothetical protein 


2.82 


4.12E-02 


LICl 0053 




Hypothetical protein 


2.81 


1.23 E-02 


LICl 0460 




Hypothetical protein 


2.74 


5.31 E-03 


LICl 3084 




Hypothetical protein 


2.72 


2.03 E-02 


LICl 2263 


ompL37 


Hypothetical protein 


2.67 


8.81 E-03 


LIC20245 




Hypothetical protein 


2.66 


4.04E-02 


LIC12719 




Hypothetical protein 


2.66 


2.24E-02 


LICl 1275 




Hypothetical protein 


2.66 


3.36E-02 


LICl 321 2 




Hypothetical protein 


2.61 


3.75E-02 


LICl 2353 




Hypothetical protein 


2.53 


4.28E-02 


LICl 1565 




Hypothetical protein 


2.52 


2.11 E-02 


LICl 0050 




Hypothetical protein with OmpA-like domain 


2.46 


1 .54E-02 


LICl 11 77 




Hypothetical protein 


2.43 


2. 67 E-02 


LICl 1447 




Hypothetical protein 


2.42 


3.36E-02 


LICl 0302 




Hypothetical protein 


2.23 


2.28E-02 


LICl 2071 




Hypothetical protein 


2.08 


3. 02 E-02 
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Gene ID^ 




Product^ 


Fold (DMC vs IVI 


P value (adjusted) 


Putative or Probable Lipoproteins (unlfr^own function) 


UO2099 


lipL53 


Lipoprotein 


19.50 


1 .03E-26 


Lia 1030 




Lipoprotein 


11.18 


1 .78E-04 


1102209 




Lipoprotein 


7.76 


2.75E-15 


LICl 1058 


lemA 


Lipoprotein 


6.27 


8.69E-07 


LIC10373 




Lipoprotein 


5.37 


4.61 E-11 


1103066 




Lipoprotein 


5.34 


3.09E-09 


UO0462 




Lipoprotein^ 


5.08 


3.17E-07 


LIO0371 




Lipoprotein 


4.61 


9.64E-08 


UO0054 


rlpA 


Lipoprotein 


3.84 


1 .44E-05 


LI03355 




Lipoprotein 


3.72 


1 .25E-02 


LIO2208 




Lipoprotein 


3.64 


2.07E-04 


UO0461 




Lipoprotein 


3.54 


1 .84E-04 


UO0968 




Lipoprotein^ 


3.44 


2.81 E-03 


LIO0463 




Lipoprotein 


3.31 


4.81 E-04 


LIO 1082 




Lipoprotein 


3.15 


2.03 E-03 


L101167 




Lipoprotein 


2.24 


2.03E-02 


Patfiogenicity and/or Virulence 


UO2760 


colA 


Collagenase precursor 


49.03 


2.63E-51 


LIO 2631 


sph2 


Hemolysin/sphlngomyellnase-lil<e protein, Sph2 


13.92 


5.13E-26 


LIO 1219 


ahpC 


Peroxiredoxin 


5.96 


7.29E-07 


LIO 0465 


ligA 


Leptospira lg-lil<e protein LigA^ 


4.57 


2.03E-02 


LIO 2632 


sphl 


Hemolysin/sphingomyelinase-lil<e protein, Sphl 


4.02 


1.52E-05 


LIO 2927 




Cytochrome c peroxidase^ 


4.23 


1.04E-05 


LI03198 


sph3 


Hemolysin/sphlngomyelinase-lil<e protein, Sph3 


3.57 


1 .99E-02 


LI02659 


vapB 


Virulence-associated protein 


2.90 


2.66E-02 


Chemotaxis and l\/lotility 


LIO 0299 


fIgB 


Flagellar basal body rod protein FIgB 


7.89 


4.89E-16 


LIO 1889 


flaB 


Flagellin protein 


3.95 


1 .50E-02 


LIO 0298 


fIgC 


Flagellar basal body rod protein FIgC 


3.91 


2.12E-05 


LIO 1328 


figj 


Flagellum-specific muramidase 


3.89 


7.28E-03 


LIO 0297 


fUE 


Flagellar hook-basal body protein FliE 


3.16 


1.87E-03 


LIO 1326 


fIgH 


Flagellar L-ring protein precursor 


2.66 


3.03E-02 


LIO 1186 


fIbC 


Flagellar protein 


2.37 


3.72E-02 


Regulatory Functions 


LI02798 




TetR family transcriptional regulator 


5.64 


3.65E-05 


LIO 1146 




DeoR family transcriptional regulator 


4.42 


3.08E-03 


LIO2034 


fur 


Fur family transcriptional regulator 


3.83 


7.84E-04 


LIO 1440 




Histidine kinase response regulator hybrid protein 


3.36 


1 .04E-03 


LIO 0996 




EAL-type diguanylate phosphodiesterase 


3.23 


2.11 E-03 


LIO 1617 




ArsR family transcriptional regulator 


3.11 


7.39E-03 


LIC20025 




Cyclic nucleotide binding protein 


2.41 


4.73E-02 


Intermediary Metabolism and Biosynthesis of Small Molecules 


LIO 3053 


desA 


Fatty acid desaturase 


6.72 


2.98E-1 1 


LI02981 




Glutathione S-transferase 


4.00 


2.59E-02 


LIO 3397 


phoD 


Alkaline phosphatase 


3.21 


6.13E-04 


LIO 3085 


coaE 


Dephospho-CoA kinase 


3.20 


1 .79E-02 


LIC20148 


hoi 


Heme oxygenase 


3.06 


3.35E-03 


LIO 2322 




Glutaconate CoA transferase-like protein 


2.32 


4.80E-02 


LIO 3031 




Aminotransferase 


2.25 


1 .46E-02 
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Table 2. Cont. 





Gene ID^ 




Product^ 


Fold (DMC vs IVI 


P value (adjusted) 


UC1346S 




Gly tRNA 


4.69 


1 .69E-02 


Cell Processes/Transport 


Ua 1694 




TonB-dependent outer membrane receptor 


14.85 


2.26E-13 


Lia0964 


phuR 


TonB-dependent outer membrane hemin receptor 


3.46 


2.23E-02 


LIC20M9 




Multidrug-efflux transporter 


2.82 


4.22E-02 


LIC12992 


sulP 


Sulfate permease 


2.81 


4.05 E-03 


DNA Metabolism and Cell Division 


UCU467 


reel 


Regulator of chromosome condensation 


7.80 


1.92E-03 


UC12737 




Site-specific modification DNA-methyltransferase 


5.12 


6.37E-1 1 


UC12297 




DNA repair protein 


4.55 


3.80E-02 


UC10131 


mesJ 


Cell cycle protein 


3.41 


3.62E-04 


LIC10252 




Exonuclease 


3.31 


5.86E-03 


UC13389 


mutS 


DNA mismatch repair protein 


2.56 


3.80E-02 


ua 1479 


xerD 


Integrase/recombinase protein 


2.91 


3. 02 E-03 


Macromolecular Metabolism and Cell Structure 


UC10537 




OmpA-like peptidoglycan-associated periplasmic protein 


3.79 


1 .OOE-03 


LICW528 


pbpB 


Peniclllln-blnding protein 3 


2.46 


1 .99E-02 


Miscellaneous 


LIC10149 


frnE 


Polyketlde synthase 


4.89 


1 .08E-02 


UCW251 




RadSO-like protein 


3.22 


4.05E-03 


ua 1850 


rmsE 


16S ribosomal RNA methyltransferase RsmE 


2.41 


8.14E-03 



^Gene designations and protein product descriptions are based on those of [1 1,12] and the L. interrogans sv. Copenhageni Genome Project database (http://aeg.lbl.ic. 
unicamp.br/world/lic/), except where indicated. 
Annotation based on Setubal ef al. [47]. 
^Revised annotation based on biolnformatics. 
doi:l 0.1 371 /journal.ppat.1 004004.1002 



difFerentially-expressed genes confirmed that each biological 
replicate clustered with its respective sample source (Figure S2). 
Of the 110 genes upregulated by L. interrogans within DMCs, 106 
are on Chromosome 1 while only 4 are on Chromosome 2 
(Table 2). All but 3 of the upregulated genes appear to be 
pathogen-specific {i.e., a paralogous gene/protein could not be 
identified in L. biflexa; 54 of these are unique to L. interrogans and an 
additional 7 are unique to serovar Copenhageni (Figure 3). Almost 
half (49/110) of the genes upregulated in DMCs encode 
hypothetical proteins (Figure 4 and Table 3), which is consistent 
with the overall percentage (40%) of hypothetical genes annotated 
within L. interrogans [6,1 1]. Based on searches performed using the 
Conserved Domain Database [44,45], none of the hypothetical 



proteins encoded by these genes contained readily identifiable 
functional domains (data not shown). However, one gene {LIC12986} 
recently was shown to be required for leptospires to survive within 
hamsters and to coloniie the renal tubules of mice [46] . 

Of the remaining upregulated genes, 16 encode putative 
lipoproteins of unknown function [47] (Figure 4 and Table 3). 
Surface-exposed spirochetal lipoproteins have been implicated in a 
wide range of pathogenesis-related functions, including adherence 
to extracellular matrix components and nutrient acquisition [48]. 
However, because the mechanism(s) responsible for sorting 
individual spirochetal lipoproteins remain poorly understood, it 
is not possible to predict based on amino acid sequence alone 
which, if any, might function at the pathogen-host interface. 



Table 3. Leptospiral genes differentially-expressed within DMCs compared to in vitro. 





Genes 


No of genes in each category 








Upregulated (%)^ 


Downregulated (%)^ 


Total (%)^ 


Known or predicted function 


45 (41%) 


20 (36%) 


66 (40%) 


Unknown or poorly characterized 
function^ 


65 (59%) 


36 (64%) 


101 (60%) 


Total 


110 


56 


166 



^Percentage of genes based on the total number of genes in upregulated or downregulated category. 
^Percentage of genes based on the total number of differentially-expressed genes. 
^Hypothetical proteins and uncharacterized lipoproteins. 
doi:1 0.1 371/journal.ppat.1 004004.1003 
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Miscellaneous 

Macromolecular Metabolism and Cell Struc. 
DNA Metabolism and Cell Division 
Cell Processes/Transport 
Inter. Metabolism and Biosyn. of Sm. Molec. 

Regulatory Functions- 
Chemotaxis and Motility- 
Pathogenicity and/or Virulence 
Putative or Probable Lipoproteins - 
Hypothetical protein 



Ups 
Down 



— I — 

20 



~40 



'so 



100 



# Genes per Category 



Figure 3. Conservation of L interrogans sv. Copenhageni Fiocruz L1-130 differentially-expressed genes among virulent and 
saprophytic Leptospira spp. Protein sequence similarities were determined using GLSEARCH (v. 34.05). Genomes used for analysis: L 
interrogans sv. Lai strain 56601, L. borgpetersenii sv. Hardjo strain L550, L. santarosai sv. Shermani strain LT821; L. licerasiae sv. Varillal strain VAR010; 
and L biflexa sv. Patoc strain Patoci Ames, respectively. The color coding used in the heat map is as follows: blue, 95-100% identity; green, 90-94% 
identity; orange, 85-89%; and yellow, 80-84%. 
doi:10.1371/journal.ppat.1004004.g003 



Virulence-associated genes. Eight DMC-upregulated genes 
encode proteins implicated in pathogenicity and/or virulence. 
LIC12760/ colA, the most significantly upregulated gene (49-fold) in 
our studies, encodes a coUagenase precursor. Degradation of host 
tissues by this enzyme is thought to promote bacterial colonization 
and/or dissemination as well as provide an additional source of 
nutrients {e.g., amino acids) [49]. LIC12631, UC12632 and 
LIC13198, respectively, encode Sph2, Sphl and Sph3. Lysis of host 
erythrocytes by Sph2 may enhance acquisition of fatty acids and 
heme/iron from the host. Narayanavari et al. [33] also raised the 
possibility that the non-catalytic Sphs (Sphl and Sph3) function as 
adhesins via their interaction with host sphingomyelin. LIC 10465 
encodes leptospiral immunoglobulin-like (Lig) protein A; this 



multifunctional, outer membrane-associated lipoprotein has been 
shown to promote binding to host molecules, including fibronec- 
tin, fibrinogen and extracellular matrix [50,51,52]. Moreover, 
antibodies against LigA are protective in a hamster model of 
acute infection [53]. LIC12659/vapB encodes a putative viru- 
lence-associated protein with similarity to the AbrB-like family of 
transcriptional regulators [54]; ArbB-like transcription factors, 
also referred to as transition state regulator proteins, have been 
identified in diverse bacteria but only orthologs from Bacillus have 
been characterized with respect to function and DNA-binding 
capabilities [55,56]. LIC11219 and LIC12927, encoding a 
peroxiredoxin (AhpC) and cytochrome c peroxidase, respectively, 
are discussed below. 



Upregulated genes 

L. biflexa 



L. licerasiae 
L. borgpetersenii 
L santarosai 
L. inter, sv. Lai 




-L-dJ JlL .L JJJJI 

iii|iii«»i|H|iiiiiiiiiiiiiii 

Downregulated genes so ss 90 95 100 

/_ ^jfie^g % Sequence identity 

L. licerasiae 



L. borgpetersenii 
L. santarosai 
L. inter, sv. Lai 




iliiiilliiilHiliillliiliili 



Figure 4. Functional categories of genes differentially-expressed by L interrogans sv Copenhageni strain Fiocruz L1-130 within 
DMCs. Functional categories are based on those of [1 1,12] and the Leptospira interrogans sv. Copenhageni Genome Project database (http://aeg.lbi. 
ic.unicamp.br/world/lic/). The number of upregulated (Dps) and downregulated (Down) genes within each category are indicated in red and blue, 
respectively. 

doi:10.1371/journal.ppat.1004004.g004 
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Motility-related genes. Consistent with the highly invasive 
nature of leptospiral infection, seven motility-related genes were 
upregulated within DMCs (Table 2), including three {LIC 10299/ 
JlgB, UCl 0298 /JlgC and LIC10297 /JliE) involved in flagellar basal 
body formation (Figure 5). The L. interrogans genome contains five 
copies oiJlaB {UC11889, LIC11890, LIC11531, 11011532 and 
LIC12947), which encode the flagellar core subunit flagellin. Of 
these, only LIC11889 was differentially-expressed within DMCs. 
Interestingly, based on the number of uniquely mapped reads 
determined by DESeq, LIC12947 was expressed at substantially 
lower levels than the other four JlaB paralogs under both growth 
conditions (Table S2), suggesting that this gene product may not 
contribute significantly to the formation of flagella in vitro or in vivo. 

Uptake and utilization of iron/heme. Unlike B. burgdorferi 
[57], L. interrogans requires iron for growth in vitro and, presumably, 
within the mammalian host. In EMJH medium, leptospires obtain 
iron from Fe(II) sulphate, while organisms in the mammal acquire 
iron from heme and/ or heme-containing proteins [58] . Heme 
(free or complexed with hemoglobin) is appropriated from the host 
by high-aflinity TonB-dependent outer membrane receptor (TB- 
DR) proteins. Based on bioinformatic analysis, L. interrogans 
encodes at least 13 putative TB-DRs [58], however, only two 
{UCl 0964 and LIC11694) were upregulated within DMCs 
(Table 3 and Table S4). While most often associated with iron 
uptake, TB-DRs also may bind vitamin B 1 2, a nutrient essential for 
leptospires in vitro and, presumably, in vivo [59]. Only one TB-DR 
in L. interrogans sv. Copenhageni {LIC12374/ htuB) is annotated as 
being specific for vitamin B12, and the gene encoding this 
transporter component was not differentially-expressed in DMCs 
compared to in vitro (Table S4). Transport of heme and/ or iron 
across the outer membrane requires energy produced by an inner 
membrane complex of the energy transduction protein TonB and 
two accessory proteins, ExbB and ExbD [59] . L. interrogans encodes 
at least two TonB-ExbB-ExbD complexes, arranged in separate 
operons, one on each chromosome. Interestingly, the transporter 
on Chromosome 2 {LIC202 16-202 18) was expressed at much 
higher levels (> 14-fold) than its counterpart on Chromosome 1 
{LICl 0889-1 0892) under both growth conditions. Neither operon, 
however, was differentially-expressed in DMCs. 

Consistent with an increased requirement for iron and/ or heme 
in vivo, we detected increased expression (3.27-fold) of heme 
oxygenase {LIC20148/hol) [60] within DMCs. Once released, iron 



would be stored in the cytoplasm by bacterioferritin (LICl 1310) 
and/or ferredoxin (LIC'13258 and LIC13209) gene products, all of 
which were well expressed by leptospires in vitro and in DMCS 
(Table S5). 

When in excess, iron can lead to toxicity via die production of 
reactive oxygen species (ROS). As such, bacterial genes associated 
with iron homeostasis often are regulated by the ferric uptake 
regulator protein Fur, a global iron-responsive transcriptional 
repressor [61]. L. interrogans encodes at least four putative Fur 
paralogs [LIC11006, LIC11158, LIC12034 and L1C20147). We 
used SLiMSearch [62] to survey the L. interrogans genome for "fur 
boxes "( [GC] AT [AT] AT [GC] AT [AT] AT [GC] AT [AT] AT [GC] ) 
[61], and were unable to identify any obvious Fur-regulated genes 
(data not shown). Fur proteins, including those encoded by 
Leptospira spp. [58], share significant sequence similarity with 
orthologs for Zur, a zinc uptake regulator, and Per, an oxidative 
stress response regulator [63]. Based on bioinformatics and/or 
experimental evidence, two of Furs identified in L. interrogans sv. 
Copenhageni {LICl 2034 and LIC20147) appear to encode Per 
orthologs [22,58]. One of these {LICl 2034) was upregulated in 
DMCs (Table 2), suggesting that leptospires within DMCs are 
under some degree of oxidative stress. 

Oxidative and thermal stress-related genes. Leptospires 
must cope with numerous stressors within the host, most notably, 
oxidative stress. Incomplete reduction of oxygen by iron-contain- 
ing cytochromes is one potential source of endogenous ROS [64]. 
Leptospires likely encounter exogenously-derived ROS within the 
proximal renal tubules, a highly oxygenated tissue niche. Not 
surprisingly, L. interrogans encodes a more diverse repertoire of 
antioxidant proteins than either Treponema pallidum or B. burgdorferi 
(Table S5). Although L. interrogans encodes a functional catalase 
[65] , it lacks superoxide dismutase (the enzyme typically associated 
with detoxification of 02' ) and the regulatory proteins OxyR and 
SoxR. Interestingly, only two oxidative stress-associated genes 
{LIC12927 and LIC11219) were upregulated within DMCs 
(Table 2). The former encodes a cytochrome c peroxidase while 
the latter encodes an AhpC-type peroxiredoxin. In E. coli, AhpC 
scavenges basal levels of endogenous peroxide generated as a 
metabolic by-product [66]. Increased expression of AhpC within 
DMCs is consistent with increased uptake of exogenously-derived 
heme (see above) and increased potential for Fenton chemistry 
within the cytoplasm. Like T. pallidum [67], L. interrogans does not 
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Figure 5. IGB viewer of normalized gene expression data for the flagellar genes fliE, fIgB and f/gC. Visualization of normalized mapped 
reads for minus (-) strand of an operon encoding genes fliE, fIgB and fIgC of the flagellar proximal rod shows increased expression by leptospires 
cultivated in dialysis membrane chambers (DMC, green) compared to those cultivated in vitro (IV, red). Annotated genes on Chromosome 1 are in 
blue. The vertical "read count" scale is 0-50. 
doi:10.1371/journal.ppat.1004004.g005 
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encode an AhpF, the usual reducing partner for AhpC, and most 
likely uses thioredoxin/ thioredoxin reductase and/ or glutaredoxin 
for this purpose, all of which were well expressed by DMC- 
cultivated leptospires (Table S5). 

In addition to oxidative stress, increased temperature within the 
host might induce a stress response by leptospires in vivo [18,19]. 
However, consistent with previous reports [19,68,69], none of the 
classical heat shock response genes encoded by L. interrogans were 
upregulated within DMCs, compared to in vitro growth at 30°C 
(Table S4). 

Regulators of transcription. The leptospiral genome en- 
codes >200 gene products with the potential to directly regulate 
transcription (Figure 3), including numerous two-component 
sensor histidine kinases (HKs) and/or response regulators (RRs), 
alternate sigma factors, sigma factor regulators, anti-sigma factor 
antagonists, and trans-acting factors [11,12]. Only a few of these 
were upregulated within DMCs. One [LIC11440) encodes a hybrid 
sensor kinase/ response regulator (HK/RR) protein; the sensor for 
this HK/RR contains a PAS-type sensor domain, which typically 
recognize small molecules, including heme [70] . Three additional 
genes {UC12798, UC11146 and UC11617) were DMC-upregu- 
lated and encode putative transcriptional regulators belonging to 
the TetR [71], DeoR [72], and ArsC [73] families of repressor 
proteins. 

Genes whose expression was significantly 
downregulated in DMCs compared to in vitro 

By RNA-Seq, we identified 56 genes (47 on Chromosome 1 and 
9 on Chromosome 2) that were downregulated in DMCs (Tables 3 
and 4). AH of the downregulated genes are pathogen-specific (i.e., 
not found in L. biflexd); almost half (26/56) are unique to L. 
interrogans (i.e., not in L. borgpetersenii, L. santarosai or L. licerasiae) 
(Figure 3). As with the upregulated gene subset, more than half 
(35/56) of the DMC-downregulated genes encode hypothetical 
proteins (Figure 4 and Table 3); of note, almost half (43%) of these 
appear to be transcribed in two polycistronic operons (LIC10173- 
10177 and UCl 2604-1 261 6). Interestingly, all of the genes within 
these two putative operons are pathogen-specific. Only one 
lipoprotein (LIC20153) was expressed at lower levels in DMCs 
(compared to 16 upregulated). 

Five genes related to de novo heme biosynthesis (LIC20008 / hemA, 
UC20009/hemCD, UC20010/ hemB, LIC20011 /heniL and 
IIC20014/hemE] [74] were DMC-downregulated these fmdings 
imply that leptospires can scavenge heme from the mammalian 
host. The heme biosynthetic operon also contains genes encoding 
a two component system (TCS). Signal transduction by the 
orthologous TCS in L. bijlexa is required for regulation of heme 
biosynthesis [75]. Although both the histidine kinase (HK; 
UC20012) and the response regulator (RR; UC20013) were 
downregulated (2.50- and 2.22-fold; respectively) in DMCs, the 
fold-change for the RR was not significant (p = 0.097). Based on 
their tandem arrangement and similar expression profiles, these 
heme biosynthetic genes appear to be transcribed as a single 
operon. LIC2001 7 /hemG and LIC20018/hemII, encoding enzymes 
responsible for the last two steps in heme biosynthesis, respectively, 
are located downstream of the larger biosynthetic operon; both of 
these genes appear to be transcribed as monocistronic messages at 
similar levels in vitro and in DMCs (Table 4 and data not shown). 

Identification of novel candidate small RNAs 

One of the advantages of RNA-Seq is that it allows visualization 
of uniquely mapped reads within non-annotated regions of the 
genome. Using the IGB browser, we detected at least 1 1 regions 
that were transcriptionaUy-active but not protein coding; these 



non-coding RNA (ncRNA) transcripts are novel candidate small 
regulatory RNAs (sRNAs) within L. interrogans (Table 5 and Figure 
S3). Five of these are homologous to known sRNA families 

(tmRNA, RNaseP, PyrR binding site and two cobalamin sRNAs) 
(http://rfam.sanger.ac.uk/) [76,77,78,79]. The expression of 8 of 
the 1 1 putative sRNAs was validated by reverse-transcriptase PCR 
in L interrogans sv. Copenhageni strain RJ 16441 (Table 5) and all 
predicted sRNAs were highly conserved in the closely-related 
virulent serovar type, strain Lai [80]. One of the predicted sRNAs, 
LIClncSO (Figure 6), was significantly DMC-upregulated (4.39- 
fold) compared to in ariro-cultivated leptospires (Table S2). Further 
characterization of these candidate sRNAs (i.e., by Northern blot) 
is required to understand their function(s) and relationships to the 
surrounding genes (i.e., 5' UTR verses bone fide sRNA). 

Validation of RNA-Seq data by quantitative RT-PCR 

To validate our RNA-Seq data, we performed quantitative 
reverse transcription-PCR (qRT-PCR) on a panel of 14 genes that 
were, according to DESeq analysis, upregulated (LIC12631/sph2, 
UC11888 and UC11889ljlaB), downregulated (LIC10175, 
LIC10179 and UCA261S), or unchanged (LIC10191 /loa22, 
UC12966/lipL41, LIC13166/ompL36, LIC10787/flaA'2, 
UCl 0068, LIC10421, LIC12339, and LIC20001) in DMCs 
compared to in OTfro. While there is some debate regarding the 
most appropriate leptospiral gene to use for normalization [21,81], 
we selected LIC11352/lipL32 based on studies demonstrating that 
its expression was relatively unchanged under a wide-range of 
growth conditions, including increased temperature, increased 
osmolarity, and/or exposure to serum [21,38,68]. Representative 
results are shown in Figure 7A; data for the entire panel are 
presented in Figure S4. Overall, we saw strong agreement between 
our RNA-Seq and qRT-PCR datasets; the correlation coefficient 
(R^) between RNA-Serj and qRT-PCR data across the entire 
panel was 0.8881 (Figure 7B). We also used qRT-PCR to confirm 
the relative expression for two (LIClnc60/KNzse P and LIC2nclO/ 
cobalamin) of the putative sRNAs (Figure S4); of these, only 
Z/CincffO/RNaseP was upregulated (2.65-fold; p = 0.0054) widiin 
DMCs. 

Discussion 

The identification of genes/proteins that are differentially- 
expressed by microorganisms only during infection and/ or within 
specific host niches often provides insight into the parasitic 
strategies of pathogens. During natural and experimental infection 
in rats, L. interrogans rapidly disseminate hematogenously to all 
tissues but are cleared by 7 days post-inoculation from sdl sites 
except the kidneys [7,82]. The ability of leptospires to colonize and 
persist within renal tubules almost certainly involves unique 
virulent determinants [1]; however, the paucUbacillary nature of 
leptospiral infection, even within this preferred niche, hinders our 
ability to perform global gene expression studies on L. interrogans 
within host tissues. Prior studies, including several using micro- 
array-based approaches [18,19,21,22,83], have manipulated in 
vitro growth conditions to simulate the environmental signals 
encountered by leptospires within the mammal. Based on 
extensive studies with B. burgdorfeii, another pathogenic spirochete, 
we and others have demonstrated that bone fide mammalian host 
adaptation is a complex and dynamic process that cannot be fully 
reproduced ex vivo [23,27,32]. We therefore used a rat peritoneal 
dialysis membrane chamber (DMC) model to generate sulficient L. 
interrogans in a mammalian host-adapted state to perform global 
transcriptional studies. Cultivation of leptospires within DMCs, in 
conjunction with next generation sequencing, enabled us to define 
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Table 4. L. interrogans sv. Copenhageni genes downregulated In DMCs compared to in vitro. 



Gene ID^ 




Product' 


Fold (DMC vs IV) 


P value (adjusted) 


Hypothetical Proteins 


uaoi73 




Hypothetical protein 


-31.35 


2.28E-09 


LIC10174 




Hypothetical protein 


-29.41 


4.12E-02 


Lia0176 




Hypothetical protein 


- 1 6.09 


1 .52E-05 


LIC10175 




Hypothetical protein 


-12.71 


4.24E-15 


ucueii 




Hypothetical protein 


- 1 1 .09 


2.02E-1 1 


UC12616 




Hypothetical protein 


-9.22 


3.54E-03 


UC12615 




Hypothetical protein 


-8.46 


2.01 E-1 2 


LIC12614 




Hypothetical protein 


-8.13 


1 .59E-05 


LIC12610 




Hypothetical protein 


-6.99 


2.07E-06 


LIC10179 




Hypothetical protein 


-6.81 


5.27E-03 


LIC13417 




Hypothetical protein 


-6.59 


2.38E-08 


LICl 1893 




Hypothetical protein 


-6.21 


4.89E-04 


LIC12612 




Hypothetical protein 


-6.19 


2.50E-03 


LICl 3247 




Hypothetical protein 


-6.01 


1 .79E-02 


UC10177 




Hypothetical protein 


-6.01 


4.1 1 E-06 


LIC12608 




Hypothetical protein 


-5.54 


8.50E-04 


LIC12613 




Hypothetical protein 


-5.46 


1 .09E-05 


LIC13416 




Hypothetical protein 


-5.04 


5.73E-06 


LICl 2604 




Hypothetical protein 


-4.94 


2.02E-03 


LIC10189 




Hypothetical protein 


-4.83 


9.23E-03 


LICl 2692 




Hypothetical protein 


-4.76 


2.40E-05 


LICl 2338 




Hypothetical protein 


-4.75 


3.31 E-02 


LICl 0880 




Hypothetical protein 


-4.49 


3.89E-02 


LIC10186 




Hypothetical protein 


-4.27 


1.17E-02 


LIC12430 




Hypothetical protein 


-3.51 


1 .99E-02 


LIC10187 




Hypothetical protein 


-3.38 


2.03E-02 


LICl 2693 




Hypothetical protein 


-3.36 


2.65E-03 


LIC13012 




Hypothetical protein 


-3.34 


8.21 E-03 


LICl 2678 




Hypothetical protein 


-3.30 


4.75E-02 


LICl 2607 




Hypothetical protein 


-3.21 


3.49E-02 


LIC12605 




Hypothetical protein 


-3.14 


8.38E-03 


LICIOISI 




Hypothetical protein 


-3.13 


4.52E-02 


LICl 2676 




Hypothetical protein 


-2.87 


1 .49E-02 


LICl 3086 




Hypothetical protein^ 


-2.77 


2.1 1 E-03 


LICl 0663 




Hypothetical protein 


-2.65 


1 .69E-02 


Putative or Probable Lipoproteins (unknown function) 


LIC20153 




Lipoprotein^ 


-2.90 


2.27E-02 


Regulatory Functions 


LICl 3087 




Histidine kinase sensor protein 


-4.55 


2.15E-02 


LICl 2431 




TetR family transcriptional regulator 


-2.87 


1 .49E-02 


LIC20012 




Histidine kinase sensor protein 


-2.52 


4.03E-02 


Cellular Processes/Transport 


LICl 2428 


phnL 


ABC transporter ATP-binding protein 


-7.96 


2. 03 E-02 


Intermediary Metabolism and Biosynthesis of Small Molecules 


LIC20008 


hemA 


Glutamyl-tRNA reductase 


-6.31 


2.78E-09 


LICl 1758 




Acyl-CoA hydrolase 


-5.56 


2.23E-02 


LIC12772 


proB 


Gamma-glutamyl kinase 


-3.97 


2.53E-02 


LIC20009 


hemC 


Porphobilinogen deaminase 


-3.74 


1 .82E-04 
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Table 4. Cont. 



Gene ID^ 




Product^ 


Fold (DMC vs IV) 


P value (adjusted) 


UCW409 


teuA 


2-isopropylm3l3tG synth3S6 


— 3.34 


1 .93E-03 


LIC20014 


hemE 


Uroporphyrinogen decarboxylase 


-3.33 


3.41 E-03 


LIC1056S 


hbdl 


Enoyl-CoA hydratase 


-3.28 


2.26E-02 


LIC20238 


speH 


S-adenosylmethionine decarboxylase like protein 


-3.28 


1.19E-03 


LIC10403 


ribH 


6,7-dimethyl-8-ribityllumazine synthase 


-2.86 


3.06E-02 


LIC2001 1 


hemL 


Glutamate-1-semialdehyde aminotransferase 


-2.44 


1 .49E-02 


LIC20239 


speD 


S-adenosylmethionine decarboxylase proenzyme 


-2.31 


1 .99E-02 


LIC20010 


hemB 


Delta-aminolevulinic acid dehydratase 


-2.29 


3.57E-02 


Macromolecular Metabolism and Cell Structure 


UC12017 


cIpB 


ATP-dependent protease 


-4.71 


1 .1 1 E-02 


Lia2360 


pirin 


Pirin 


-2.83 


6.60E-03 


Lia23S7 


fusA 


Elongation factor EF-G 


-2.71 


3.53E-03 


LIC11911 




Glycosyltransferase 


-2.65 


4.04E-02 



'Gene designations and protein product descriptions are based on those of [1 1,12] and the L interrogans sv. Copenhageni Genome Project database (http://aeg.lbi.ic. 
unicamp.br/world/lic/) except where indicated. 
^Annotation based on Setubal ef al. [47]. 
doi:l 0.1 371 /journal.ppat.l 004004.1004 



for the first time the transcriptome of L. interrogans within the 
mammalian liost. 

In order to transition from a free living to infectious state, 
leptospires must adjust their metabolism to utilize nutrients 
available within the mammalian host. Quite surprisingly, we 
found that the majority of genes implicated in central and 
intermediary metabolism were expressed by leptospires at similar 
levels in DMCs and in vitro. We interpret these data to suggest that 
EMJH, the medium commonly used to cultivate pathogenic and 
saprophytic leptospires in vitro, reflects the overaU composition of 
nutrients available within mammalian host fairly weU. Nonethe- 
less, leptospires cultivated within DMCs differentially-regulated a 
handful of genes whose products are involved in metabolic and 
biosynthetic pathways, most notably, heme uptake and utilization 
(see below). Although increased temperature often is imphcated as 



an important stimulus for host adaptation, we observed very little 
overlap (<10%) between the cohort of genes that were 
upregulated in DMCs and those previously identified as being 
temperature-regulated in vitro [18,19,21,68]. Thus, differential 
gene regulation by leptospires within DMCs appears to be driven 
primarily by non-thermal mammalian host-specific stimuli. The 
relatively small pore size of the dialysis tubing used to cultivate 
leptospires within rat peritoneal cavities would exclude macro- 
molecules and most serum proteins but aUow for efiicient 
exchange of nutrients {i.e., glucose, ions, and free amino acids) 
present within serum. These are the same types of small molecules 
that leptospires likely encounter within proximal convoluted 
tubules, where the composition of the glomerular ultrafiltrate 
most closely resembles that of interstitial fluid [84]. Further 
experimentation is required to assess how closely DMC-cultivated 



Table 5. Candidate small non-coding RNAs identified by RNA-Seq. 



Transcript^ 


Homology^ 


E-value 


Chr 


Genome Coordinates 


sRNA size Validated^ 


LlClnclO 


tmRNA 


1 .50E-62 




1 75,606 


1 75,960 


355 


+ 


LIClnc20 


PyrR binding site 


1 .50E-04 




263,598 


264,013 


416 


+ 


LIClncSO 








849,634 


849,900 


267 


+ 


LlClncSO 








2,109,156 


2,109,444 


289 


+ 


LlOncSS 


Cobalamin 


3E-25 




2,878,556 


2,878,746 


191 




LIClnc60 


RNaseP 


5.50E-31 




3,031,445 


3,031,846 


402 


+ 


LiancSO 








4,015,037 


4,015,237 


201 




LIC2ncW 


Cobalamin 


6.80E-19 


2 


159,019 


159,243 


225 


+ 


LIC2nc20 






2 


242,735 


243,092 


358 


+ 


UC2nc30 






2 


246,062 


246,477 


416 


+ 


UC2nc40 






2 


348,946 


349,168 


223 




'Predicted sRNAs 


are annotated according to the genome 


of L. interrogans sv. Copenhageni (LIC) chromosome num 


ber followed by non 


coding RNA designation as 



Included in Supplementary Table S2. 
^Homology to known sRNA families is indicated as is the E-value when transcripts were searched against the Rfam database. 
^Expression was validated by reverse-transcriptase PCR in L. interrogans sv. Copenhageni strain RJ16441. 
doi:l 0.1 371/journal.ppat.l 004004.1005 
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LIC1nc80 




LI C1 3273 



Annotation 



UC13274 



Figure 6. IGB viewer of candidate sRNA LICIncSO. LICncSO was identified as an area of high transcriptional activity within an intergenic region 
of the genome of L interrogans sv. Copenhageni Fiocruz LI -130. Expression data for leptospires cultivated in DIVlCs (green) compared to those 
cultivated in vitro (IV, red) are indicated on the plus strand of the genome. Annotated genes on the relevant chromosome and nucleotide co- 
ordinates are indicated. The vertical "read count" scale is 0-100. 
doi:l 0.1 371 /journal.ppat.1 004004.g006 
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3- 0.10- 




L/CJ2675(Down 8.46-fold in DMCs) 
15.8-fold; p<0.0001 
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Figure 7. Validation of comparative RNA-Seq analysis. (A) qRT-pCR analysis of representative genes identified by RNA-Seq. Values represent 
the average transcript copy numbers for each gene normalized per lipL32 transcript. Bars indicate the standard error of the mean (SEM). Results 
presented are mean values from at least 3 biologically-independent samples of leptospires for each growth condition. The fold-regulation for each 
gene determined by RNA-Seq is indicated in parentheses. The fold-regulation between in vitro- (IV) and DMC-cultivated leptospires determined by 
qRT-PCR are indicated. P values were calculated using an unpaired f-test. (B) Correlation coefficient (R^) between RNA-Seq and qRT-PCR data. 
doi:l 0.1 371/journal.ppat.1 004004.g007 
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leptospires resemble their counterparts within host tissues during 
acute and/ or chronic infection. The DMC model does have some 
limitations. For instance, virulence genes associated with pulmo- 
nary haemorrhage may be expressed only within the context of 
lung tissue. Because bacteria within DMCs are prevented from 
interacting with host immune cells and immunoglobulin [85], this 
model does not enable us to identify genes that are differentially- 
regulated in response to specific pathogen-host interactions and/ or 
immune evasion. 

Although increased temperature often is implicated as an 
important stimulus for host adaptation, we observed very little 
overlap (<10%) between the cohort of genes that were 
upregulated in DMCs and those previously identified as being 
temperature-regulated in vitro [18,19,21,68]. We observed a 
similarly limited overlap between our RNA-Seq data and genes 
found to be differentially regulated in vitro in response to exposure 
to serum [21] and low iron [22]. We observed a somewhat higher, 
but nonetheless small, degree of overlap (16%) between our RNA- 
seq dataset and genes identified by Matsunaga et al. [20] as being 
upregulated by physiologic osmolarity (EMJH supplemented with 
120 mM NaCl); included in this overlap are lipL53 {LIC12099), 
sph2 (LIC12631), a putative CoA-transferase {LIC12322), phoD 
{UC13397) and hoi (LIC20148; see below). Thus, diflFerential gene 
regulation by leptospires within DMCs appears to be driven by 
mammalian host-specific stimuli that are not readily reproduced in 
vitro. 

The relatively small pore size of the dialysis tubing used to 
culti\'atc l<'])t()spires within rat peritoneal cavities would exclude 
macromolecules and most serum proteins but allows for efficient 
exchange of nutrients [i.e., glucose, ions, and free amino acids) 
present within serum. These are the same types of small molecules 
that leptospires likely encounter within proximal convoluted 
tubules, where the composition of the glomerular ultrafiltrate 
most closely resembles that of interstitial fluid [84]. Further 
experimentation is required to assess how closely DMC-cultivated 
leptospires resemble their counterparts within host tissues during 
acute and/ or chronic infection. The DMC model does have some 
limitations. For instance, virulence genes associated with pulmo- 
nary haemorrhage may be expressed only within the context of 
lung tissue. Because bacteria within DMCs are prevented from 
interacting with host immune cells and immunoglobuKn [85], this 
model does not enable us to identify genes that are difTerentiaUy- 
regulated in response to specific pathogen-host interactions and/ or 
immune evasion. 

To date, >20 named species oi Leptospira have been identified 
based on molecular taxonomic analyses [86]. Leptospira spp. can be 
further divided into three major groups based on pathogenicity: 
pathogenic (9 species), intermediate virulence (5 species) and free- 
living saprophytes (6 species). The vast majority (69%) of genes 
upregulated by leptospires in response to mammalian host signals 
are found only in pathogenic and intermediate virulence species 
{i.e., absent in L. bijlexa), suggesting that their gene products may 
help promote infection and/ or colonization within mammal. 
However, more than half (64/110) of these upregulated genes 
encode either hypothetical proteins or lipoproteins of unknown 
function without any obvious conserved/functional domains. 
While their functions remain to be determined, our finding that 
these protein-coding genes are difiFerentially-regulated in response 
to mammalian host-specific signals make them attractive candi- 
dates for further experimentation in animals model and, in 
particular, their potential use as part of a mono- or multi-valent 
protein-based vaccine. Thirt)r-five of the 56 genes downregulated 
in DMCs encode hypothetical proteins. Interestingly, all but 7 of 
these are unique to pathogenic and intermediate virulence species. 



raising the possibility that these genes products, while not required 
for survival within the host, facilitate the transition from a free- 
living to infective state. 

Heme is the major source of iron in L. interrogans and also serves 
as a cofactor for proteins essential for respiration (i.e., cyto- 
chromes), biosynthesis of vitamin B12, and detoxification of 
reactive oxygen intermediates {i.e., catalase). Unlike B. burgdorferi 
[87] and T. pallidum [88], L. interrogans possess a complete set of 
genes required for de novo heme biosynthesis as well as the uptake 
and utilization of exogenous heme [58,74,89]. By RNA-Seq, 
expression of 6 heme biosynthesis genes was significantiy 
downregulated in DMCs compared to in vitro, while heme 
oxygenase (LIC20148/holj andphuR, encoding a TonB-dependent 
heme receptor, were upregulated; these data support the notion 
that pathogenic leptospires preferentially use exogenously derived 
heme within the mammal. Of the four putative fur orthologs 
encodes by L. interrogans, only one {LLC12034) was upregulated in 
DMCs. Recentiy, Marcsisin et al. [46] demonstrated that 
inactivation of this gene had no effect on virulence in a hamster 
acute infection, implying that this Fur paralog is not responsible 
for downregulation of the heme operon within DMCs. Alterna- 
tively, downregulation of heme biosynthesis is not a prerequisite 
for survival in vivo. Because heme is highly toxic [90], there is 
relatively littie, if any, free heme within plasma [91]. In the 
glomerulus, the molecular weight cut-off for ultrafiltration is 
~70 kDa [92]. Thus, while L. interrogans is able to use haemoglobin 
(64 kDa) as a source of heme in vitro [60], this micronutrient is 
likely present in only minute amounts within the proximal tubules. 
Smaller molecules (^20 kDa), only the other hand, easily pass 
through the glomerulus into Bowmen's capsule; it is worth noting 
that this molecular weight cut-off is essentially equivalent to that of 
the dialysis tubing used for our DMCs (8 kDa MWCO). Instead, 
leptospires may be using myoglobin (16.7 kDa), which is present in 
human plasma at concentrations similar to that of haemoglobin 
[93]. Both h("moglobin and myoglobin, released by red blood cell 
turnover and muscle tissue damage, respectively, are filtered by the 
kidneys and would be available to leptospires within the renal 
tubules. 

Small non-coding RNAs (sRNAs) are increasingly recognized as 
essential post-transcriptional gen(; (-xpression regulators that 
enable bacteria to adjust their physiology in response to 
environmental cues [94]. Bacterial sRNAs range from 50 to 500 
nucleotides and frequentiy are located within intergenic regions 
[95]. By diverse mechanisms, including changes in RNA 
conformation, interactions with DNA, other RNAs and proteins, 
sRNAs can modulate transcription, translation, mRNA stability 
and DNA maintenance or silencing [96,97]. Five of the 11 
candidate sRNAs identified as part of this study are conserved in 
bacteria and known to carry out specific housekeeping functions, 
including RNase P (LLClncfiO), responsible for processing of 
tRNAs and other RNAs, and tmRNA (LLClnclO), which acts as 
both a transfer RNA (tRNA) and mRNA to tag incompletely- 
translated proteins for degradation and to release stalled proteins 
[76,77]. We also identified two cobalamin riboswitches {LIClnc55 
and LIC2nclff), which act as cir-regulatory elements in 5' 
untranslated regions of vitamin Bi2-related genes; aUosteric 
rearrangement of mRNA structure is mediated by ligand binding 
resulting in modulation of gene expression or translation of mRNA 
[78]. LIClne55 lies upstream oi LLC121374/btuB, which encodes a 
constitutively-expressed TonB-dependent outer membrane cobal- 
amin receptor protein [98] . We also identified a candidate sRNA 
[LIC2ncl(Pj upstream of LLC20135; although annotated as a 
ferredoxin, LIC20135 contains a domain conserved within 
sirohydrochlorinin cobalt chelatases, an important enzyme 
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involved in biosynthesis of vitamin B12. Finally, LIClnc20 contains 
a conserved PyrR binding site; this RNA element is found 
upstream of genes involved in pyrimidine biosynthesis and 
transport in Bacillus suhtilis [79]. In L. interrogans, this sRNA was 
found downstream of genes encoding hypothetical proteins. In 
addition to these known sRNAs, we identified six transcriptionally- 
active, non-coding regions that encode novel candidate regulatory 
sRNAs. UClncSO, UClncSO, UC2nc30 and UC2ne40 were all 
identified in the 5' untranslated regions for LIC14007, LIC10702, 
UC20192 and UC20276, respectively, aU of which encode 
proteins of unknown function. Tlu- remaining two putative sRNAs 
[LlClncSO and LIC2nc20) are located in the 3' untranslated region 
of gen(-s, which are known to be a repository of sRNAs in other 
bacterial species [99]. 

The L. interrogans genome encodes >200 proteins whose 
annotations suggest a role in transcriptional regulation [i.e., sigma 
factors, anti-sigma factors and trans-acting factors), two-compo- 
nent signal transduction and the synthesis/ degradation of cycUc 
nucleotides [11,12]. By RNA-Seq, the vast majority of these 
putative regulatory proteins were expressed at similar levels in vitro 
and in DMCs; this finding is not unexpected given that these types 
of regulator^' factors typically are activated at the protein level by 
endogenously- or exogenously-derived small molecules and 
environmental stimuli [100,101,102]. 

Recent advances in Leptospira molecular genetics, including the 
development of site-directed [103] and transposon-mediated 
[104,105,106] mutagenesis techniques, now make it possible to 
determine the contribution(s) of genes that are regulated within 
DMCs. We anticipate that this approach will identify proteins 
involved in environmental sensing, mammalian host adaptation 
and/ or the expression of specific virulence determinants in vivo. 

Materials and Methods 

Ethics statement 

AU animal experimentation was conducted following the Guide 
for the Care and Use of Laboratory Animals (Eighth Edition) and 
in accordance with protocol (ACC# 100570-0116) reviewed and 
approved by the University of Connecticut Health Center 
Institutional Animal Care and Use Committee. The UCHC 
laboratory animal care program is accredited by the Association 
for Assessment and Accreditation of Laboratory Animal Care. The 
USDA Site ID: Customer Number 44, Certificate Number 16-R- 
0025, PHS Assurance Number A3471-01. 

Bacteria 

Virulent low-passage Leptospira interrogans sv. Copenhageni 
strains Fiocruz LI- 130, kindly provided by Dr. David Haake 
(UCLA), and RJ 16441 were cultivated in vitro under standard 
conditions at 30°C in EMJH medium [107] supplemented with 
1% rabbit serum (Pel-Freez Biologicals, Rogers, AR) with 100 ng/ 
ml 5-fluorouracil. Cultures were passaged in vitro no more than 3 
times before being used for experimentation. 

Cultivation of virulent L interrogans within dialysis 
membrane chambers 

To obtain L. interrogans in a mammalian liost-adapt(xi state, 
organisms were cultivated in dialysis membrane chambers (DMCs) 
as previously described [23,24]. Briefly, DMCs were constructed 
using standard dialysis membrane tubing (Spectra-Por; 8000 
MWCO). Prior to use, 8-inch strips of dialysis tubing were tied off" 
at one end and then sterilized by boiling for 20 min in sterile water 
containing 5 mM EDTA, followed by two successive boiling 
washes in water alone. Dialysis bags were cooled to room 



temperature and then filled with ~8-9 mis of EMJH medium 
(supplemented with 10% vaccine-grade bovine serum albumin to 
maintain osmotic pressure) containing 1 0* organisms per ml. Once 
filled, the tubing was tied and excess membrane removed from 
both ends. For implantation, female Sprague-Dawley rats (150- 
174 g) were anesthetized by intramuscular injection of a mixture 
of ketamine (50 mg/kg), xylazine (5 mg/kg), and acepromazine 
(1 mg/kg). Using strict aseptic technique, a DMC was implanted 
into the peritoneal cavity of each rat. Analgesia (carprofen; 5 mg/ 
kg) was administered on the day of surgery and once the following 
day. At designated time points (typically 9-10 days post- 
implantation), rats were euthanized by CO2 narcosis and DMCs 
recovered. The contents of each chamber were removed by gentie 
syringe aspiration with an 18G needle; the needle was removed 
prior to expelling the DMC dialysate into a sterile 15 ml conical 
bottom tube. Bacteria were enumerated by dark field microscopy 
immediately following explant using a Petroff-Hausser counting 
chamber (Hausser Scientific Co., Horsham, PA). 

Gel electrophoresis and immunoblotting 

In arfro-cultivated L. interrogans, harvested at late-log phase 
(5 X lO**-! X 10^ per ml) and leptospires explanted from DMCs 
were processed for one- and two-dimensional SDS-polyacrylamide 
gel electrophoresis (ID and 2D SDS-PAGE, respectively) as 
previously described [8] . Protein concentrations were determined 
using the DC protein assay kit (Bio-Rad). Total protein separated 
by ID SDS-PAGE was detected by SYPRO Ruby protein gel stain 
(Sigma-Aldrich Inc, Ireland) as per manufacturer's instructions. 
Images were visualized with the BioSpectrum AC Imaging System 
(Ultra- Violet Products Ltd, UK). For immunoblotting, proteins 
were transferred to nylon-supported nitrocellulose, incubated with 
rabbit polyclonal antiserum directed against Sph2 [34], LipL32 
[38] and LipL41 [39] followed by goat anti-rabbit secondary 
antibody (Southern Biotechnology Associates, Birmingham, Ala.). 
Blots were developed using the SuperSignal West Pico chemilu- 
minescence substrate according to the manufacturer's instructions 
(Pierce, Rockford, 111.). 2D gels were loaded with 500 |Xg total 
protein and stained with silver as previously described [8] . 

RNA isolation, library preparation and RNA-Seq 

Total RNA was extracted using TRIzol reagent (Invitrogen) 
from three biologically-independent samples of (i) in ra'fro-cultivated 
leptospires or (ii) leptospires cultivated in DMCs (2 rats per sample) 
for 10 days as described above. Purified RNA was treated with 
Turbo DNA/rce (Ambion, Inc. Austin, TX) as previously described 
[108] to remove contaminating genomic DNA. The integrity of 
DNase-trcat(;d RNAs use for RNA-Seq were assessed using the 
Agilent Bioanalyzer RNA NanoChip (Agilent Technologies, 
Wilmington, DE) to ensure that each had an RNA integrity 
(RIN) value SB. One-hundred ng of total RNA was used for 
library generation according to Dlumina standard protocols 
(TruSeq RNA Sample Preparation Guide, Low-Throughput 
Protocol, Part # 15008136 Rev. A). cDNAs were normalized 
using a duplex-specific nuclease (DSN) approach according to the 
DSN Normahzation Sample Preparation Guide, Early Access 
Protocol, Part # 15014673 Rev. C, which decreases the 
prevalence of highly abundant transcripts, such as rRNAs. 76-bp 
paired-end sequencing was carried out by Sequensys (Prognosys 
Biosciences, La JoUa, USA) on an lUumina Genome Analyzer IIx 
according to the manufacturer's instructions. 

RNA-Seq data analysis 

Mapping of sequenced reads to Chromosome 1 and 2 of the 
reference genome of Leptospira interrogans sv. Copenhageni strain 
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Fiocuz Ll-130 (NCBI Reference Sequence: NC_005823.1 and 
NC_005824. 1 respectively) [1 1] was carried out using the software 
tool segemehl [109] with accuracy set to 100%. To increase 
coverage, mismatched nucleotides at the lower-quality 3' end were 
removed from the reads and the mapping was repeated until a 
match was found or the read length decreased below 20 
nucleotides (see [110]). Reads that mapped to (i) ribosomal or 
transfer RNAs or (ii) more than one reference genome location 
{e.g., paralogous genes) were discarded. Uniquely mapped reads 
{i.e., mapped to a single genomic location) were selected for further 
analysis, sue li as data visualisation and determination of differen- 
tial gen(; (-xpression. Normalization, differentially-expressed genes, 
regulator}' fold-changes and statistical significance were deter- 
mined using DESeq [43]. Read coverage used for graphical 
display was normalized as follows to compensate for different 
library sizes: the number of reads covering each nucleotide 
position was divided by the total number of mapped reads in the 
Ubrary and then multiphed with the number of mapped reads 
from the smallest library. Mapped unique reads were visualised 
with the Integrated Genome Browser (IGB, version 5.5) (http:// 
bioviz.org/igb/ releases.html) [111]. 

Bioinformatics 

Putative orthologous relations between proteins in other 
Leptospira serovars and/or species were determined using BlastP 
alignment (S40% amino acid identify over S80% of the length of 
the smallest protein) as previously described [14] . Protein sequence 
similarity between differentially-expressed genes identified in L. 
interrogans sv. Copenhageni and other Leptospira spp. {L. interrogans 
sv. Lai strain 56601 [80]; L. borgpetersenii sv. Hardjo strain L550 
[13]; L. santarosai sv. Shermani strain LT821 [15]; L. licerasiae sv. 
VariUal strain VAROlO [16]; and L. hiflexa sv. Patoc strain Patocl 
Ames [14]) was determined using GLSEARCH (version 35.04) 
from the FASTA package [112]. GLSEARCH identifies the 
optimal alignment across the entire genome of each strain, 
translated into ail six reading frames, and calculates the percent 
identity across the whole length of the corresponding sequence. 
Conserved domain searches were performed on fuU length protein 
coding sequences using the NCBI Conserved Domain Database 
interface [44,45] . The presence of fur boxes was investigated using 
the predictive computational tool SLiMSearch [62]. SLiMSearch, 
which can be used to determine the occurrences of a predefined 
motif in DNA and protein sequences, makes use of disorder and 
conservation masking to reduce the number of false positives. The 
fur box consensus sequence ([GC]AT[AT]AT[GC]AT[A- 
T]AT[GC]AT[AT]AT[GC]) used to search the genome of 
Lepto.spira interrogans sv. Copenhageni was based on that of [61]. 
Putative functions of candidate sRNAs were identified by BLAST 
using the Rfam database, Wellcome Trust Sanger Institute 
(http:// rfam.sanger.ac.uk/). 

Quantitative RT-PCR 

DNase-treated RNAs (~1 (ig per sample), isolated from 
leptospires grown to late-logarithmic phase at 30°C in vitro and 
within DMC, were prepared as described above and converted to 
cDNA using Superscript III (Invitrogen) in the presence and 
absence of reverse transcriptase (RT) according to the manufac- 
turer's instructions. cDNAs were assayed in quadruplicate using 
iQ^ Supermix (Bio-Rad) using the primer pairs described in Table 
SI. For relative quantitation of transcript levels, amphcons 
corresponding to each gene of interest were cloned into the 
pCR2.1-TOPO cloning vector (Invitrogen), then purified recom- 
binant plasmid DNAs for each amplicon were diluted (10-10^ 
copies/jtl) to generate a standard curve. Reaction conditions for 



each primer pair were optimized to ensure that each had an 
amplification efficiency of >90%. Transcript copy numbers for 
each gene of interest were calculated using the iCycler post-run 
analysis software based on internal standard curves then normal- 
ized against copies of lipL32 (LIC11352) present in the same 
cDNA. Normalized copy number values were compared within 
Prism v5.00 (GraphPad Software, San Diego, CA) using an 
unpaired i-test with two-tailed p values and a 95% confidence 
interval. 

Supporting Information 

Figure SI Comparison of leptospires cultivated in vitro 
and within DMCs by two dimensional SDS-PAGE 
revealed numerous polypeptide differences. Protein ly- 
sates prepared from L. interrogans sv. Copenhageni strain Fiocruz 
Fl-130 grown at 30°C in EMJH medium (top) or within dialysis 
membrane chambers (DMCs; bottom). Total protein (500 |J.g per 
gel) was solubilized in 7 M urea, 2 M Thiourea and 1% ASB-14 
and separated by two-dimensional gel electrophoresis as previously 
described [8]. Proteins were visualized with Lavapurple. 
(TIFF) 

Figure S2 Clustering of biological replicates. Heatmap 
representing the expression data for genes whose expression was 
either positively- or negatively-regulated by SLog2-fold (adjusted 
p-value^0.05) in DMC- versus in crfro-cultivated L. interrogans sv. 
Copenhageni strain Fiocruz Ll-130. 
(TIF) 

Figure S3 IGB viewer of putative sRNAs [LlClnclO - 
LIC2nc40) mapping to non-annotated regions of the 
genome. Candidate sRNAs were identified as areas of high 
transcriptional activity in intergenic regions of the genome of L. 
interrogans sv. Copenhageni Fiocruz Ll-130. Expression data for 
leptospires cultivated in DMCs (green) compared to those 
cultivated in vitro (IV, red) are indicated on plus strand of the 
genome. Annotated genes on the relevant chromosome and 
nucleotide coordinates are indicated. The vertical "read count" 
scale is 0-100. 
(TIFF) 

Figure S4 VaUdation of RNA-Seq analysis. qRT-PCR 
analysis of the entire panel of genes used to validate RNA-Seq data 
derived from L. interrogans sv. Copenhageni cultivated in EMJH at 
30°C in vitro (IV) and within DMCs. Values represent the average 
transcript copy number for each gene normalized per copy of 
lipL32. Bars indicate the standard error of the mean (SEM). 
Results presented are mean values from at least 3 biologically- 
independent samples of leptospires for each growth condition. The 
fold-regulation for each gene determined by RNA-Seq is indicated 
in parentheses. The folds of regulation between in vitro- and DMC- 
cultivated leptospires determined by qRT-PCR are indicated. P 
values were calculated using an unpaired Hest. 
(TIF) 

Table SI Oligonucleotide primers used in these studies. 

(DOCX) 

Table S2 Gene expression data for Chromosome 1 and 

2. Column A: G(-n(- identification. Column B: Mean number of 
DESeq values for each gene in all 6 biological replicates. Column 
C: Mean DESeq values for each gene in 3 biological replicates of 
leptospires cultured in vito (IV), Column D: Mean DESeq values 
for each gene in 3 biological replicates of leptospires cultured in 
dialysis membrane chambers (DMC), Column E: Fold change 
gene expression by DMC leptospires compared to IV leptospires. 
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Column F: Log2 fold change gene expression by DMC leptospires 
compared to IV leptospires, Column G: p-value of differential 
gene expression, Column H: adjusted p-value of differential gene 
expression, Column I: Residual variance of DESeq values for each 
gene in three biological replicates of IV, Column J: Residual 
variance of DESeq values for each gene in tiux-c biological 
replicates of DMC, Column L-Q: DESeq values for each gene in 
each biological replicate. Column S: DNA strand location for each 
gene. Column T: location of gene on positive or negative strand. 
Column U: Start position for each gene, Column V: End position 
for each gene, Column W: gene length, Column X: gene name. 
Column Y: gene product. Datasheets are arranged to present (1) 
data for all genes sorted according to the expression values in 
DMCs; (2) data for all genes sorted according to the expression 
values in vito; (3) data for genes whose expression was upregulated 
within DMCs compared to in vitro; and (4) data for genes whose 
expression was downregulated within DMCs compared to in vitro. 
PCLSX) 

Table S3 Top 100 protein-coding genes expressed by L. 
interrogans within DMCs. 

(DOCK) 

Table S4 Expression data for individual genes and 
pathways highlighted in the manuscript. 

(DOCX) 
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